Using interdocument similarity information in document retrieval systems
نویسندگان
چکیده
منابع مشابه
Using interdocument similarity information in document retrieval systems
The first part of this paper reports a comparative study of the document classifications produced by the use of the single linkage, complete linkage, group average, and Ward clustering methods. Studies of cluster membership and of the effectiveness of cluster searches support previous findings that suggest that the single linkage classifications are rather different from those produced by the o...
متن کاملDocument Retrieval using Predication Similarity
Document retrieval has been an important research problem over many years in the information retrieval community. State-of-the-art techniques utilize various methods in matching documents to a given document including keywords, phrases, and annotations. In this paper, we propose a new approach for document retrieval that utilizes predications (subject-predicate-object triples) extracted from th...
متن کاملWeb Document Retrieval Using Sentence-Query Similarity
For the web document retrieval experiments in our TREC '2002 participation, we used two new methods. One is the use of anchor texts, which has been advocated by many researchers. But the methods used by them is different from our method. The second is the use of sentence-query similarity. It has been known that the use of links for web retrieval did not show impressive improvement in performanc...
متن کاملOptimizing Document Similarity Detection in Persian Information Retrieval
Most data on the Web is in the form of text or image. Finding desired data on the Web in a timely and cost-effective way is a problem of wide interest. In the last several years, many search engines have been created to help Web users find desired information. In this paper we present a new technique to eliminate the affixes and their effects on recognizing similar Persian documents. Reviewing ...
متن کاملInformation Retrieval Systems for Large Document Collections
Practical information retrieval systems must manage large volumes of data, often divided into several collections that may be held on separate machines. Techniques for locating matches to queries must therefore consider identiication of probable collections as well as identiication of documents that are probable answers. Furthermore , the large amounts of data involved motivates the use of comp...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: Journal of the American Society for Information Science
سال: 1986
ISSN: 0002-8231,1097-4571
DOI: 10.1002/asi.4630370102